Statistically-driven generation of multidimensional analytical schemas from linked data
نویسندگان
چکیده
The ever-increasing Linked Data (LD) initiative has given place to open, large amounts of semi-structured and rich data published on the Web. However, effective analytical tools that aid the user in his/her analysis and go beyond browsing and querying are still lacking. To address this issue, we propose the automatic generation of multidimensional analytical stars (MDAS). The success of the multidimensional (MD) model for data analysis has been in great part due to its simplicity. Therefore, in this paper we aim at automatically discovering MD conceptual patterns that summarize LD. These patterns resemble the MD star schema typical of relational data warehousing. The underlying foundations of our method is a statistical framework that takes into account both concept and instance data. We present an implementation that makes use of the statistical framework to generate the MDAS. We have performed several experiments that assess and validate the statistical approach with two well-known and large LD sets.
منابع مشابه
Automatic validation of requirements to support multidimensional design
Article history: Received 7 October 2008 Received in revised form 9 March 2010 Accepted 11 March 2010 Available online 27 March 2010 It iswidely accepted that the conceptual schemaof a datawarehousemust be structuredaccording to themultidimensionalmodel. Moreover, it has been suggested that the ideal scenario for deriving the multidimensional conceptual schema of the datawarehousewould consist ...
متن کاملMultidimensional Schemas Quality Assessment
A data warehouse is a database focused on decision making. It is built separately from the transactional (OLTP) databases of the enterprise, although it is partly fed from transactional data. Data warehouses are typically accessed by decision makers using OLAP tools, based on a specific, multidimensional representation of data. Considering the strategic importance of data warehouses, the qualit...
متن کاملA Metadata-based Recommender System for Statistical Linked Open Data
In recent years, there are increasing efforts of Business Intelligence (BI) and Semantic Web communities to enable On-Line Analytical Processing (OLAP) over Statistical Linked Open Data. Unlike internal sources where data organization is generally familiar, Linked Data sources are typically uncontrolled and bring challenges regarding the integrity constraints and data completeness required by t...
متن کاملAn Automatic Method for the Design of Multidimensional Schemas from Object Oriented Databases
A data warehouse (DW) is a large data repository system designed for decision-making purposes. Its design relies on a speci ̄c model called multidimensional. This multidimensional model supports analyses of huge volumes of data that trace the enterprise's activities over time. Several design methods were proposed to build multidimensional schemas from either the relational data model or the enti...
متن کاملThe Conceptual Integration Modeling Framework: Abstracting from the Multidimensional Model
Data warehouses are overwhelmingly built through a bottom-up process, which starts with the identification of sources, continues with the extraction and transformation of data from these sources, and then loads the data into a set of data marts according to desired multidimensional relational schemas. End user business intelligence tools are added on top of the materialized multidimensional sch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Knowl.-Based Syst.
دوره 110 شماره
صفحات -
تاریخ انتشار 2016